# Reinforcement Learning Reasoning
Acereason Nemotron 14B GGUF
Other
A math and programming reasoning model trained with reinforcement learning, excelling in multiple benchmark tests
Large Language Model
Transformers English

A
unsloth
1,417
4
Open Reasoner Zero 7B
MIT
Open Reasoner Zero is an open-source solution for large-scale reinforcement learning based on foundational models, focusing on scalability, simplicity, and ease of use for large-scale reasoning-oriented reinforcement learning.
Large Language Model
Transformers

O
Open-Reasoner-Zero
776
28
Deepseek R1 Zero
MIT
DeepSeek-R1 is the first-generation reasoning model developed by DeepSeek, trained through reinforcement learning, excelling in mathematics, coding, and reasoning tasks.
Large Language Model
Transformers

D
deepseek-ai
4,034
905
Featured Recommended AI Models